Andreas Krause

mentions 1 type Person feed RSS

// recent coverage 1 mentions

19:08

2026-06-28

lesswrong.com

ai-safety

Anthropomorphic Misalignment research needs stronger evidence

Researchers at ETH Zurich argue in a new ICML 2026 position paper that AI safety studies on anthropomorphic behaviors like deception and scheming lack rigorous evidence, risking misallocated resources…

// co-occurs with top 7 entities

ETH Zurich 1 Vansh Gupta 1 Peter Nutter 1 Samuel Stante 1 Florian Tramèr 1 Lukas Fluri 1 Xin Chen 1